Optimal Web-Scale Tiering as a Flow Problem

نویسندگان

  • Gilbert Leung
  • Novi Quadrianto
  • Alexander J. Smola
  • Kostas Tsioutsiouliklis
چکیده

We present a fast online solver for large scale parametric max-flow problems as they occur in portfolio optimization, inventory management, computer vision, and logistics; •Our algorithm solves an integer linear program in an online fashion; • It exploits total unimodularity of the constraint matrix and a Lagrangian relaxation to solve the problem as a convex online game; • The algorithm generates approximate solutions of max-flow problems by performing stochastic gradient descent on a set of flows; •We apply the algorithm to optimize tier arrangement of over 80 Million web pages on a layered set of caches to serve an incoming query stream optimally. Motivating Example The Tiering Problem Goal: • Select documents to be stored in successive tiers or caches of decreasing access frequency • such that frequently accessed documents are found in the highest tiers • thus the search engine will be able to cover incoming queries with low latency and computational load. One proposed solution: •Assign a value to each document and arrange them such that the highest valued documents reside in the highest levels of the cache; • But this is sub-optimal. Reason: to answer a given query well, a search engine returns not only a single document but a list of r (typically r = 10) documents. doc 1 doc 2. doc 3 doc 4 doc 5 doc 6 doc 7 doc 8 doc 9 doc 10 cache 1

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimizing Makespan with Start Time Dependent Jobs in a Two Machine Flow Shop

[if gte mso 9]> The purpose of this paper is to consider the problem of scheduling a set of start time-dependent jobs in a two-machine flow shop, in which the actual processing times of jobs increase linearly according to their starting time. The objective of this problem is to minimize the makespan. The problem is known to be NP-hardness[ah1] ; therefore, there is no polynomial-time algorithm...

متن کامل

The Optimal Power Flow of Multiple Energy Carriers in Networked Multi-Carrier Microgrid

The future distribution network comprising different energy carriers will include small-scale energy resources (SSERs) and loads, known as a Networked multi-carrier microgrid (NMCMG). This concept not only leads to an efficient reduction in operation costs, but also encompasses the energy transformation between gas and electric networks at combined nodes, as well as district heating networks. I...

متن کامل

MILP Formulation and Genetic Algorithm for Non-permutation Flow Shop Scheduling Problem with Availability Constraints

In this paper, we consider a flow shop scheduling problem with availability constraints (FSSPAC) for the objective of minimizing the makespan. In such a problem, machines are not continuously available for processing jobs due to preventive maintenance activities. We proposed a mixed-integer linear programming (MILP) model for this problem which can generate non-permutation schedules. Furthermor...

متن کامل

Improvement of DC Optimal Power Flow Problem Based on Nodal Approximation of Transmission Losses

This paper presents a method to improve the accuracy of DC Optimal Power Flow problem, based on evaluating some nodal shares of transmission losses, and illustrates its efficiency through comparing with the conventional DCOPF solution, as well as the full AC one. This method provides three main advantages, confirming its efficiency: 1- It results in such generation levels, line flows, and noda...

متن کامل

Heuristic approach to solve hybrid flow shop scheduling problem with unrelated parallel machines

In hybrid flow shop scheduling problem (HFS) with unrelated parallel machines, a set of n jobs are processed on k machines. A mixed integer linear programming (MILP) model for the HFS scheduling problems with unrelated parallel machines has been proposed to minimize the maximum completion time (makespan). Since the problem is shown to be NP-complete, it is necessary to use heuristic methods to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010